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Abstract — We investigate the role of cooperation in wireless 
networks subject to a spatial degrees of freedom limitation. To 
address the worst case scenario, we consider a free-space line- 
of-sight type environment with no scattering and no fading. We 
identify three qualitatively different operating regimes that are 
determined by how the area of the network A, normalized with 
respect to the wavelength A, compares to the number of users n. 
In networks with \/A/X < yfn, the limitation in spatial degrees 
of freedom does not allow to achieve a capacity scaling better 
than yfn, and this performance can be readily achieved by multi- 
hopping. This result has been recently shown in Q. However, for 
networks with fA/X > yfn, the number of available degrees of 
freedom is min(n, y/A/X), larger that what can be achieved by 
multi-hopping. We show that the optimal capacity scaling in this 
regime is achieved by hierarchical cooperation. In particular, in 
networks with y/A/X > n, hierarchical cooperation can achieve 
linear scaling. 

I. Introduction 

Multi-hop is the communication architecture of current 
wireless networks such as mesh or ad hoc networks. Packets 
are sent from each source to its destination via multiple 
relay nodes. Each relay decodes the packets sent from the 
previous relay and forwards them to the next relay. Can 
more sophisticated cooperation between nodes significantly 
increase capacity of such networks? This is an important 
question concerning future communication architectures for 
such networks, and information theory has been brought to 
bear to try to shed some light on this question. Adopting the 
scaling law formulation of Gupta and Kumar 0~), much focus 
has been on the asymptotic regime where the number of nodes 
is large. Two diametrically opposite answers have emerged: 

• 1) Capacity can be significantly improved when nodes 
form distributed MIMO arrays via an intelligent cooper- 
ation architecture J3J, J4). The total degrees of freedom 
in the network is n, the number of nodes, and in regimes 
where power is not a limiting factor, the capacity can 
scale almost linearly with n. 

• 2) The total degrees of freedom in the network is not n 
but is actually upper bounded by yfn due to the spatial 
constraints imposed by the physical channel Q. Nearest- 
neighbor multi-hop is optimal to achieve this scaling QJ. 

This is no mathematical contradiction between these two 
sets of results. They are based on two different channel 
models. The key difference is the assumption on the phases 
of the channel gains between the nodes. 0, 2) assume that 



the phases are uniform and independent across the different 
channel gains. |7|, on the other hand, starts from physical 
principles and regards the phases as functions of the locations 
of the nodes. While the physical channel model used in Q 
is more fundamental, the i.i.d. phase model is also widely 
accepted in wireless communication engineering, particularly 
for nodes in far field from each other. Is there a way to 
reconcile the two sets of results? 

A deeper look at JT) provides a clue. The spatial degrees of 
freedom limitation in [7] is actually dictated by the diameter of 
the network rather than the number of nodes. More precisely, 
the spatial degrees of freedom in the network are limited by 
\fA/X, where A is the area of the network and A is the 
carrier frequency. This number can be heuristically thought 
of as an upper bound to the total degrees of freedom in the 
network as a whole and puts a limitation on the maximum 
possible cooperation gain. The conclusion that the capacity 
scales like yfn comes from the assumption that the density of 
nodes is fixed as the number of nodes n grows, so that \f~A/ A 
is proportional to yfn. But for actual networks, there can be 
a huge difference between \J~AjX and yfn. Take an example 
of a network serving n = 10, 000 users on a campus of 1 
km 2 , operating at 3 GHz: \f~AjX = 10000, while yfn is only 
100, two orders of magnitude smaller. So while multi-hop can 
achieve a total throughput of the order of 100 bits/s/Hz, there 
is still a lot of potential for cooperation gain, since the spatial 
degrees of freedom upper bound is 10, 000. 

So the ultimate cooperation gain is limited by \J~Aj A, while 
multi-hop performance depends on the number of nodes n only 
and not on sfAj A. But the number of nodes and the area are 
two independent parameters of a network, each of which can 
take on a wide range of values. To yield a complete picture 
of whether cooperation can help, the key is to remove the 
artificial coupling between these two parameters and analyze 
the capacity in terms of the two parameters separately. This is 
the goal of the present paper. We focus on a physical channel 
model similar to that used in 0, but with only a line-of-sight 
channel between each pair of nodes, a case in which spatial 
limitation is expected to be the most severe. Our main result is 
that in the regime when n and A/X are both large, the capacity 
of the network is approximately 



max yfn,min(n, — — ) 
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Accordingly, the optimal operation of the network falls into 
three different operating regimes: 

• 1) vA/X < \fn: The number of spatial degrees of 
freedom is too small, cooperation is useless and nearest 
neighbor multi-hopping is optimal. 

• 2) \/A/\ > n: The number of spatial degrees of 
freedom is n, cooperation is very useful, and the optimal 
performance can be achieved by the same hierarchical 
cooperation scheme introduced in (4). Spatial degree 
of freedom limitation does not come into play and the 
performance is as though the phases are i.i.d. uniform 
across the nodes. 

• 3) \fn < yA/X < n: The number of degrees of freedom 
is smaller than n, so the spatial limitation is felt, but larger 
than what can be achieved by simple multi-hopping. 
A modification of the hierarchical cooperation scheme 
achieves optimal scaling in this regime. 

Regime (1) is essentially the conclusion of [7J; regime (2) 
is essentially the conclusion of J4) (in the case when power 
is not a limiting factor). Thus, the validity of the results in 
these papers is not universal but depends on the relationship 
between n and AJX. The upper bound of s/A/ X on the spatial 
degrees of freedom of the network is already established by 
f7). The main technical contributions of the present paper are 
two-folded: 1) we show that there are actually min(n, s/A/X) 
spatial degrees of freedom available in the physical channel 
model when \f~AjX > \fn\ 2) we show that hierarchical 
cooperation can achieve these degrees of freedom. 

Both mathematically and philosophically, the present paper 
follows the same spirit of J5). G) advocates a shift of the "large 
networks" research agenda from seeking a single "universal" 
scaling law, where the number of nodes n scales with all 
systems parameters coupled with n in a specific way, to 
seeking a multi-parameter family of scaling laws, where the 
key parameters are decoupled and many different limits with 
respect to these parameters are taken. A single scaling law with 
a particular coupling between parameters is often arbitrary 
and too restrictive to cover the wide ranges that the multiple 
parameters of the network can take on. The specific parameters 
that were decoupled in Q were the number of nodes and 
the amount of power available. The current paper follows the 
approach of Q, but focuses on the number of nodes and 
the area of the network, while assuming there is a sufficient 
amount of power available that it is not limiting performance. 
A future goal of this research program is to investigate the 
dependence of the capacity on the number of nodes, the area 
of the network and the amount of power all together. 

II. Model 

There are n nodes with transmitting and receiving capa- 
bilities that are uniformly and independently distributed in 
a rectangle of area \[A x vA Each node has an average 
transmit power budget of P Watts and the network is allocated 
a total bandwidth of W Hertz around a carrier frequency of 
/, / ^> W. Every node is both a source and a destination 
for some traffic request. The sources and destinations are 



randomly paired up one-to-one into n source-destination pairs 
without any consideration on node locations. Each source 
wants to communicate to its destination at the same rate R 
bits/s/Hz. The aggregate throughput of the system is T = nR. 

We assume that communication takes place in free-space 
line of sight type environment and the complex baseband- 
equivalent channel gain between node i and node k is given 
by 

H-ik = VG (2) 

rik 

where is the distance between the nodes i and k and A is 
the carrier wavelength. Note that the locations of the users 
are drawn randomly but remain fixed over the duration of 
the communication. Therefore for a given realization of the 
network, the channel coefficients in (O are deterministic. 
The parameter G is given by the Friis' formula, 

Gtx ■ Gr x ■ X 2 



G = 



16tt 2 



where Gt x and Gr x are the transmitter and receiver antenna 
gains respectively. The discrete-time complex baseband signal 
received by node i at time m is given by 



Yi[m] 



E 

fc=l, k=£i 



H ik Xk[m\ + Zi[m] 



(4) 



where -X"fe[m] is the signal sent by node k at time m subject 
to an average power constraint 

E(\X k \ 2 )<P/W 

and Zi[m] is complex white circularly symmetric Gaussian 

noise of variance No- The model in (f2j), (O corresponds to 

free-space propagation. It is equivalent to the model in Section 

IV of J7) but with no scatterers. We consider the case of no 

scatterers since the spatial degrees of freedom limitation is 

expected to be most severe in this case. 

It has been shown in (5) that a wireless adhoc network 

is power-limited when the long-range SNR in the network is 

smaller than dB and the long range SNR has been identified 

as „ 

CP 

SNR; := n ^ ■ (5) 



N W {VA) a 

For the current case a — 2, which implies that SNR; = SNR S , 
where SNR S is the SNR in a point-to-point transmission over 
the typical nearest neighbor distance in the network. (See 
also (6).) In the present paper, our goal is to concentrate 
on the effect of the spatial degrees of freedom limitation 
on the capacity of wireless adhoc networks. To be able to 
solely concentrate on this factor, we assume there is no power 
limitation in our network. Formally, we assume that P and W 
are such that 

SNR* > dB, (6) 

for every A and n. For the current case of a = 2, the 
condition can be equivalently stated as SNR S > dB. When 
this condition fails to hold, the network becomes power limited 
and the behavior of the capacity as well as optimal operation 
can be significantly different. 
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Fig. 1 . Two square clusters of area A c separated by a distance d. 

III. Main result 

The main result of J7) is to show that under the network 
and the channel model described in the previous section with 
the additional restriction A = n, the capacity of the wireless 
network is upper bounded by 

T<i^i\Mlogn) 2 , 

with high probabilitjQ, where A'i > is a constant indepen- 
dent of n. Coupling the area of the network a priori with the 
number of nodes in the network is restrictive and does not 
allow to deduce the nature of the limitation imposed here. A 
relatively straightforward generalization of the analysis in Q 
gives the following result. Let us define the normalized area 
of the network with respect to the wavelength A as, 

A 

A a := Y2 . 

Under the network and channel model described in the pre- 
vious section, the capacity of the wireless network is upper 
bounded by 



T < 




if A > n 
if A < n 



'n (logn) 2 , y/A^ (logV^o") 
Ri (log n) 2 

with high probability where K\ > is a constant independent 
of n and Aq. For Aq < n, this result says that the maximum 
achievable capacity is of order s/n, which is achievable 
by a simple multi-hopping scheme JTj. For Ao > n, the 
achievability remains an open issue so far. 

The following theorem is the main contribution of the 
present paper. 

Theorem 3.1: Consider the network and the channel model 
described in the previous section and assume Ao > n, the 
total throughput achieved by hierarchical cooperation is lower 
bounded by, 

T > K 2 (min(n, y/A^j 

with high probability, for any e > and a constant K-2 > 
independent of n and Aq. 

The theorem can be interpreted as follows: When Aq > n 2 , 
hierarchical cooperation can achieve an aggregate through- 
put T > K 2 n x ~ e for any e > 0. When A < n 2 , 



hierarchical cooperation can achieve an aggregate throughput 
T > K2 A^ 2+ " . Note that this throughput is larger than y/n, 
when A > n. 



IV. Hierarchical Cooperation in LOS 

ENVIRONMENTS 



The proof of Theorem 13.11 relies on the following lemma 
which establishes a lower bound on the capacity of a MIMO 
transmission between two clusters of nodes. For notational 
simplicity, in the sequel we assume that all the distances 
in the network are normalized with respect to the carrier 
wavelength A. Note that when the distances are expressed 
in wavelength units, the channel model in (0, (01 takes the 
simplified form, 



H ik = y/G 



G = 



Gtx ■ Grx 

16^2 



(7) 



Lemma 4.1: Consider two square clusters of area A c sep- 
arated by a distance d (see Figure [U, with each cluster 
containing M nodes distributed uniformly at random over A c . 
Let \/A~ c < d < A c , and the nodes in the transmit cluster Dt 
perform independent signalling with power Pq /M such that 



GP 



> OdB. 



(8) 



N Wd 2 

Then the capacity of the MIMO channel from the transmit 
cluster Dt to the receive cluster Dr is lower bounded by 

^0 1 



G 



MIMO 



logdet / 



> K 3 min M 



NqW m 
A c /d 
log(A c /d) 



with high probabilityfor some constant K3 > independent 
of M, A c and d. 

The lemma is the analog of Lemma 4.3 in H which lower 
bounds the capacity of a MIMO transmission between two 
clusters of nodes under the i.i.d. phase model. With i.i.d. 
phases, the capacity of the MIMO transmission scales linearly 
in M. The condition (HJ ensures that the MIMO transmission is 
not power limited. For the LOS case, we have the additional 



term 



A c /d 



With probability 1 as n — > 00. 



\og(A /rf) 1 wniC h corresponds to the spatial degrees of 
freedom between the two clusters. When this term is smaller 
than M, the capacity of the MIMO transmission is not any 
more linear in Al. This in turn degrades the performance of 
the hierarchical cooperation scheme which is based on such 
MIMO transmissions. 

The capacity of a MIMO transmission between two clusters 
under the current LOS channel model has been investigated 
earlier in J8). The result stated in Theorem 1 of [8] is equiva- 
lent to Lemma HTTI above. However, the proof of Theorem 1 in 
(81 is based on an approximation which is not fully justified. 
Through private communication, we have been informed of a 
follow-up work |9| by the same authors, that similarly to our 
current paper investigates the performance of the hierarchical 
cooperation scheme under the LOS channel model. 

Next, we investigate the performance of the hierarchical 
cooperation scheme and show how Lemma 14.11 allows to 
prove the result in Theorem 13. II The core of the proof is the 
following recursion lemma. 



Lemma 4.2: Consider a network of n nodes uniformly 
distributed over an area Aq > n and the available power P per 
node satisfies ([6j. Assume that there exists a communication 
scheme for this network that achieves an aggregate throughput 

T > Ka min(n, \/%) b 

with high probability for some < b < 1 and a constant 
K4 > independent of n and Aq. 

Then, we can construct another scheme for this network that 
achieves a higher aggregate throughput 

T > K 5 min(n, \/~A )^- ei 

with high probability for any e\ > and a constant A'5 > 
independent of n and Aq. 

As soon as we have a scheme to start with, Lemma 14.21 
can be applied recursively, yielding a scheme that achieves 
higher throughput at each step of the recursion. Note that 
— jrg > b for < b < 1. We first show that a simple time- 
sharing strategy between the source-destination pairs (TDMA) 
satisfies the conditions of the lemma with 6 = 0. Note that with 
TDMA, each source node transmits only a fraction 1/n of the 
total time of communication. Hence when active, each source 
node can transmit with elevated power nP and still satisfy 
its average power constraint P. This yields an SNR larger 
than SNR; in (O for each transmission, hence a constant rate. 
Therefore, the aggregate throughput achieved by TDMA is 
constant independent of n and Aq. 

Starting with TDMA, 6 = 0, and applying Lemma 14.21 
recursively h times, we get a hierarchical scheme that achieves 
an aggregate throughput of order min(n, ^/]4o) 7^ + T ~ el for any 
e[ > 0. Therefore given any e > 0, we can choose e[ = e/2 
and h such that > 1 — e/2 and we a get a scheme that 
achieves the performance in Theorem 13. II □ 

Proof of Lemma \4~2\ We will prove the lemma by concen- 
trating separately on the two cases Aq > n 2 and n < Aq < n 2 . 
In the first case, we provide a brief overview of the three-phase 
scheme from Lemma 3.1 in |4| and verify that it achieves the 
same performance in |4j under the current deterministic phase 
model. The reader should refer to (4] for a precise analysis. 
For the case n < Aq < n 2 , a modification of the scheme is 
required to achieve the performance given in Lemma |4~2l 

A. A > n 2 

Let us divide the network into square clusters of area A c . 
Each cluster contains approximately M = nodes. A 

particular source node s sends M bits to its destination node 
d in three steps: 

(51) Node s first distributes its M bits among the M nodes 
in its cluster, one bit for each node; 

(52) These nodes together can then form a distributed transmit 
antenna array, sending the M bits simultaneously to the 
destination cluster where d lies; 

(53) Each node in the destination cluster observes the MIMO 
transmission in the previous phase; it quantizes each 
observation to Q bits, with a fixed Q, and ships them 



to d, which can then do joint MIMO processing of all 
the quantized observations and decode the M transmitted 
bits from s. 

From the network point of view, all source-destination pairs 
have to eventually accomplish these three steps. Step 2 is long- 
range communication and only one source-destination pair can 
operate at a time. Steps 1 and 3 involve local communication 
and can be parallelized across clusters. 

Since there are M source nodes in every cluster, this gives 
a total traffic of exchanging M(M — 1) ~ M 2 bits inside 
each cluster in phase 1. We can handle this traffic by setting 
up M sub-phases, and assigning M pairs in each sub-phase 
to communicate their 1 bit. The traffic to be handled at each 
sub-phase is similar to our original network communication 
problem with n users on an area Aq, but now instead, we 
have M users on area A c . We handle this traffic using the 
communication scheme given in Lemma l4~2l Note that if this 
scheme achieves an aggregate throughput K4 min(n, \/~An) b 
in the network of n nodes and area Aq, it will achieve an 
aggregate rate K4 min(A/, ^J~A^) b inside the clusters of M 
nodes and area 

A C E 

This can be verified by checking that the 
clusters of M nodes and area A c satisfy the conditions of the 
lemma. We have A c > M for the clusters if Aq > n for the 
original network and 

CP 

SNR Z (M, A c ) = M = SNR* > OdB 

iVo W A c 

if P satisfies ©. Moreover when Aq > n 2 , we have 
A c > M 2 , so the performance of the scheme is 
Ki min(A'/, \fA~ c ) b = M b . The traffic in the third phase is 
handled similarly to the first phase. Then, we need: 

• M 2 ~ b /K4 time slots to complete phase 1 all over the 
network; We handle the traffic in M subphases, each 
subphase is completed in M 1 ~ b / K4 time-slots. 

• n/K 3 time-slots to complete the successive MIMO trans- 
missions in the second phase, the distributed MIMO 
transmissions between any two clusters can achieve a 
rate of K3M bits/time-slot; We perform one MIMO 
transmission for each of the n source-destination pairs 
in the network. 

• QM 2 ~ h I K3K4 time slots to complete phase 3 all over 
the network; The traffic in the third phase is symmetrical 
to the traffic in the first phase, but larger by a factor 
of Q/K3. This factor comes from the fact that each 
MIMO transmission lasts I//V3 time slots, and each of 
the corresponding 1/K 3 observations is quantized to Q 
bits. 

In 1H, it is shown that each destination node is able to 
decode the transmitted bits from its source node from the 
M quantized signals it gathers by the end of Phase 3. Thus, 
the aggregate throughput achieved by the scheme can be 
calculated as follows: each source node is able to transmit M 
bits to its destination node, hence nM bits in total are delivered 

2 We ignore the performance loss due to inter-cluster interference since it 
does not change the scaling law. The reader is referred to (6) for details. 



to their destinations in M 2 ^ h /K i + n/K 3 + QM 
time slots, yielding an aggregate throughput of 

nM 



2-6 



/K 3 K 4 



M 2 ~ b /K 4 + n/K 3 + QM 2 ~ b /K 3 K 4 



bits/time-slot. 



Choosing M = n 2 - b to maximize this expression yields an 
aggregate throughput T = K^n 2 ^ for a constant A'5 > 0. 

Note that this throughput can only be achieved if the MIMO 
transmissions in phase 2 achieve a rate linear in M. The rate of 
the MIMO transmissions are lowerbounded in Theorem l4.1l for 
the deterministic phase model under certain conditions. The 
cluster areas and the separation between the clusters should 
satisfy the condition \f~A~ c < d < A c and the users should 
transmit with power satisfying condition (0. It is easy to 
verify that \J~A~ C < d < A c . Note that \[A~ C < d is always 
true unless the communicating clusters are neighbors!! Let us 
verify that the power condition <[8J f° r the MIMO transmission 
can be satisfied under the average power constraint P per node 
satisfying (|6]). In the second phase, the MIMO transmissions 
between clusters are performed successively and each node in 
the network transmits only M /n of the time. Therefore when 
active, each node can transmit with elevated power riP/Mand 
still satisfy its average power constraint P. Observe that if 
Po = nP, the condition ^ is satisfied given (O and the fact 
that d < \f~Ao~. 

Therefore, Theorem 14. 1 1 lowerbounds the rate of the MIMO 
transmissions in the second phase. The lower bound is linear 
in M if lo ^) d) > M. If A > n 2 , using A c = MM an d 
CA~q, we obtain for sufficiently large M, 



d > 



A c /d 



> 



M^/A^/n 



\og{A c /d) ~ log(A/V^/n) 



> M 



l-ei 



for any e\ > 0. The e\ is introduced to compensate for the 
logarithmic term and in turn yields an n~ £l degradation in the 
overall throughput as stated in Lemma l4~2l This concludes the 
proof of the lemma for networks with Ao > n 2 . 

B. n < A a <n 2 

In the case n < Aq < n 2 , the proof of the lemma 
differs from the earlier case Ao > n 2 in two aspects. When 
n < Ao < n 2 , the MIMO transmissions between the clusters 
are limited in spatial degrees of freedom. More precisely, in 
Theorem l4.ll the performance is lower bounded by the second 
term 



A c /d 



loo-(Ajd) an< ^ ^ * s not an y more linear in M. This fact 
requires a modification in the operation of this phase. 

The second difference is the following: We have seen that 
when Ao > n 2 for the original network, we have A c > M 2 
for the smaller clusters. In other words, when the network is 
not spatial degrees of freedom limited at the largest scale, it 
is not spatial degrees of freedom limited at any scale. In the 
current case, when 77. < Ao < n 2 , the network is limited in 
spatial degrees of freedom at the largest scale, but the smaller 
clusters may or may not be spatial degrees of freedom limited. 

3 The special case of neighboring clusters is excluded from the current 
discussion and can be handled separately as in |4]- 



More precisely, for a cluster of smaller size, we can either 
have M < A c < M 2 or A c > M 2 . This fact requires a more 
careful analysis. In particular, we separately consider the two 

2(4-b) 2(4-b) 



cases n < Ao < 



and n 5 - 2b < Ao < 



1) n 5 - 2b < Ao < n : As before, we divide the network 
into clusters of area A c that contain M = nA c /Ao nodes 
and the goal again is to accomplish steps S1-S2-S3 for every 
source-destination pair in the network. We choose the cluster 
size in the following particular way, 



M 



i 5=5 A, 



2(2-6) 



(9) 



This is a valid choice in the sense that M < n, in particular 

2 for the network. The 



> M . Therefore 



M < n 2 - b given the condition Ao < 

2(4-6) 

condition n 5 - 2i> < A ensures that A, 
as before, the scheme given in the hypothesis of Lemma 14.21 
achieves an aggregate throughput K\ min(M, \[A~ c ) h = M b 
when used inside the clusters of area A c and number of nodes 
M. We use this scheme to handle the traffic inside the clusters 
in phases 1 and 3 as before. In the second phase, the MIMO 
transmissions achieve a rate 



A c /d 



> 



A C /VA 



log(A c /d) ~ \og(A c /V^Y 

This implies that in the second phase, the MIMO transmissions 
for each source-destination pair can not be completed in 
constant number of time-slots as before. In order for these 
MIMO transmissions of lower rate not to result in too many 
MIMO observations in the third phase containing a small 
number of degrees of freedom, we introduce the following 
modification to step (S2). Let 



M' 



A c l^fAo 



(10) 



iog(A c /VA)) 

We randomly divide the M nodes in the source cluster to 
M/M' groups each containing M' nodes. We do the same 
division also in the destination cluster. We randomly associate 
one-to-one the M/M' groups in the source cluster with the 
M/M' groups in the destination cluster. The earlier M x M 
MIMO transmission between the source and the destination 
cluster is now divided into M/M' successive MIMO transmis- 
sions, each of size M' x M'. In each of these M' x M' MIMO 
transmissions, a group of M' nodes in the source cluster are 
simultaneously transmitting their bits to their corresponding 
group in the destination cluster. Note that these M' x M' 
MIMO transmissions are not limited in spatial degrees of 
freedom, precisely due to our choice for M' in ([Tol l. We will 
later verify that these M' x M' MIMO transmissions achieve 
a rate K 3 M'. If this is the case, we need: 

• M 2 ~ b '/K4 time slots to complete phase 1 all over the 
network; 

■ n X M/M' x I/-K3 time-slots to complete the succes- 
sive MIMO transmissions in the second phase, the 
distributed M' x M' MIMO transmissions between any 
two groups can achieve a rate of K 3 M' bits/time-slot; 



• QM 2 "/K^Ki time slots to complete phase 3 all over 
the network; Note that although each cluster receives 
M x M /M' MIMO transmissions in total, M/M' MIMO 
transmissions per each destination node in the cluster, 
each node has one MIMO observation of duration 1 / K 3 
time-slots for each of the other nodes. The modification 
in the second phase is precisely made to ensure this fact. 

Thus, the aggregate throughput achieved by the scheme is 
given by 

6 3 n.M 

(11) 



M 2 ~ b /K 4 + nM/M'K 3 + QM 2 - b /K 3 K 4 

bits per time-slot. It can be verified that for the choice of the 
cluster size in ||9), we have 

M2 _ b = nM 

The three terms in the denominator of (fTTT i are order-wise 
equal or in other words, (0 is the cluster size that maximizes 
the throughput expression in dTTT >. This yields an aggregate 
throughput 

K 5 M' = K 5 - 



T 



A -El 
^0 



K 5 n^ 



2(2 M 



A-Bl 



for a constant K$ > and for any e\ > 0, which is introduced 
to compensate for the logarithmic term in (TTOb . It can be 
verified that when A < n 2 the above throughput, 

T > K 5 (^)^- £1 

which is the performance claimed in the lemma. 

It remains to verify that we can achieve a rate K 3 M' in 
the M' x M' MIMO transmissions between the two clusters 
of area A c . Note that since the M' nodes in each group are 
chosen randomly among the M nodes in each cluster, without 
any consideration on node locations, they are uniformly and 
independently distributed over the area A c . It can be readily 
verified that the condition \[A~ C < d < A c in Theorem 14.11 
is satisfied. It remains to verify that we can transmit with 
power Pq/M' such that Pq satisfies (0. Note that due to the 
extra time division between the M/M' distinct groups in each 
cluster, each node is transmitting in only M' /M of the total 
transmission time of the cluster. On the other hand, due to the 
time sharing between the clusters in the second phase, each 
cluster is only active in a fraction M/n of the total completion 
time of the phase. Therefore during the M' x M' MIMO 
transmissions, the nodes in the transmit group can transmit 
with elevated power nP/M' and still satisfy their average 
power constraint P. This, in turn, means that they can satisfy 
the power requirement (O in Theorem 14.11 



2(4-6) 

2) n < An < n s- 2b 
area as 



In this case, we choose the cluster 



Ar 



(12) 



2(4-6) 

For this choice, the current condition n < Aq < n 5 - 2i > on 
the network gives M < A c < M 2 . This implies that, the 
scheme given in the hypothesis of Lemma l4~2l can now achieve 
an aggregate throughput K4 min(M, ^/A^) b = (-y/A^) 6 when 



used inside the clusters of area A c and number of nodes M. 
Applying exactly the scheme in the earlier case (1), we now 
get an aggregate throughput 

nM 

M 2 A~ b/2 /K 4 + nM/M'K 3 + QM 2 A~ b/2 / 'K 3 K 4 ' 

The three terms in the denominator of this expression are 
order-wise equal for the cluster area given in ( TT2b . Therefore, 
the throughput achieved is given by 

2 + i> , 

K 5 M' = K*. A^ 



T 



I 2(4-6) A- £l 
i A n 



A y- 



for a constant K5 > and any e\ > 0. The last inequality 
follows from the fact that < b < 1. 

Combining the conclusions of Sections IIV-AI and IIV-BI 
above completes the proof of Lemma POl □ 

Appendix A 
Proof of Lemma|4~T1 



Lemma 14.11 will be proven in two steps. We first lower 
bound the expected capacity of the MIMO channel over 
random node positions and then show that for a random real- 
ization of the node positions, the capacity of the corresponding 
MIMO channel is not that different from its expected value. 
We formally state these two results in the following lemmas. 

Lemma A.l: The expected capacity Cmimo of the MIMO 
channel in Lemma 14.11 is lower bounded by 

HCmimo) = EQogdct (/ + (P /M) HH^)) 

> A 3 mm M, 

V log(A c /d), 

for a constant K 3 > 0, where the expectation is taken over the 
independent and uniform distribution of node positions over 
the transmit and receive domains of area A r . 
Lemma A.2: Let s 



min(M, I ^ 3y ),foranyi>0 



P(|C. 



MIMO 



E(C. 



MIMO ) 



>t)<e~ 



Choosing t = s 1 / 2 + £ 2 ; e 2 > 0, the probability in the second 
lemma decreases to zero for increasing s. This implies that 
the deviations of Cmimo from ~E(Cmimo) are > at most, of 
the order of ^J~s. Therefore combining the results of these two 
lemmas yields the result given in Lemma |4~T1 In the sequel, we 
prove Lemma I A. 1 1 The proof of Lemma IA.2I closely follows 
the proof of Proposition 5.2 in |2| and is skipped due to space 
limitations. 

Proof of Lemma \A.1\ For notational convenience, we start 
by defining 



fik = — e 



a 3 2ir\\x k -Wi\\ 



(13) 



\\x k - Wi\\ 

where denotes the distance between the nodes k € T>t and 
i G T>r located at positions x k and Wi respectively . Note that 
d < r lk < d(l + 2yJ2A c /d), and therefore 

c < (1 + 2^/2A~/d)- 1 < \f ik \ < 1, 



(14) 



where cq := (l + 2v / 2)~ 1 and the first inequality follows from 
the fact that ^J~A C < d. 

The first ingredient of the proof of Lemma |AT| is the Paley- 
Zygmund inequality used in [4| to prove Lemma 4.3. We have 

P) 1 



E(CWo) = Ehogdet 

= E ^logdet ^1 
= ME (log ( 1 
> M log | 1 



-PP f 



N W m 



N Wd 2 M 
GP 



N Q Wd 2 
GP 



A 



N Wd 2 



> 



for any t > 0, where A is an eig envalue of (1/M) FF f picked 
uniformly at random. By Paley-Zygmund's inequality, if < 
t < E(A), we have 

GPo A (E(A)-t) 2 



E(Cmimo) > M log ( 1 
Given ( TT3l . we have 



E(A 2 ) 



M 

_E(tr( J FFt ) ) = _^ E( | /ife |2 ) > c 2. 

E(A 2 ) = — 3 R(tr(FF*FF*)) 
1 M 

= "/Tf3 E ^(fikf*kflmfim) 
M 

^ 2 + i73 E nUkft k hmf; m )<2+Ms 



E(A) 



where the last inequality follows from the upper bound in (IT4l) . 
5 = \M,(f aa fbafbb fab)\ where a, 6 are two different indices 
(notice that S does not depend on the specific choice of a and 
b). See Figure |2] Choosing then t = Cq/2, we obtain 

E( Cj „ MO )>(Mc 3 /4)lo g (l + ^) 5T L_ 
> Ko min [ M, - 



for a constant .Kg > independent of M and S if 



GP 



> OdB. 



N Wd 2 

The quantity 5, which takes values between and 1, dictates 
therefore the capacity scaling. In the case where the channel 
matrix entries fik are i.i.d. phases, S = 0, so the capacity 
E(Gm/mo) is of order M. At the other end, if we consider 
the LOS channel model in ([T31 in the scenario where nodes 
are placed on a single straight line, then a simple computation 
shows that S = 1, so that E(Cmimo) is of order 1 (in this 
case, we know that the matrix F is also rank one, so the lower 
bound matches the upper bound on the capacity, up to a log M 
term). The problem we are looking at lies between these two 



x b 






• — 

fal> 








faa 








Or 



Fig. 2. S=\E(f aa f b * a f bb f* b )\ 




s/Az 



-\/~Ax 



\/~Au 



Fig. 3. Coordinate system. 

extremes. Our aim in the following is to show that if both 
A and d grow large and \[A~ C < d < A c , then there exists 
> independent of A c and d, such that 

This implies that 

E(C M imo) > K3 min ( M, 

which completes the proof. 

The rest of the section is devoted to proving (Tl3T >. Let us 
first explicitly write the expression for S. We have 

S = \^{faafbafbbfab)\ 



A c /d 



-77 I dx n 



dx b / dw a / dw b p e? ' 2lx A 
Jd r Jd r 



(16) 



where 

A = \\x„ 



w b \ 



\x b - w b \ 



p=d(\\x a - W a \\\\x a - W b \\\\x b - W b \\\\x b - W a \\) 



x b - w a \\, 
(17) 

-1 



.(18) 



We first derive the result ( Tl3T > by approximating the distance 
in ( foi l in the regime \/~A c <C d <C A c . This approximate anal- 
ysis captures most of the intuitions for the precise derivation 
which is given afterwards. Consider two nodes at positions 

x = {—y/A^x, \[A~ c y) £ Dt and w = (d + ^/A^w, yjA~ c z) £ 
Dr, where x,y,w,z £ [0,1] (see Figure 0. Using the 
assumption that d ^> \f~A~ c , we obtain 



\\x-w\\ = \J{d+ yfA~ c (x + w)) 2 +A c (y- zf 



d+ v / A c {x + w) + ^(y-z) 2 



which in turn implies 

A = \\x a - W a \\ - \\x a - W b 

A 



X b - W b \\ - \\X b ~ 10 a I 
2 /„, ., \2 , („, „ \2 /„, „ \2\ 



, {{y a - z a ) - (y a - z b ) + (y b - z b ) - (y b - z a ) ) 

= Tfa- Va) ( Z b - z a) 



dz b (.-J^tI 1 (*»-»•)(*»-*.) 



Next, let us also make the approximation that p ss 1 in < f]~8T >: 

this is actually assuming that the spatial degrees of freedom 

between the two clusters are mainly determined by the phases 

of the channel coefficients and not so much by the amplitudes. 

We will see below that this intuition is correct. 

These two successive approximations lead to the following 

expression for S: 

S ps So 

-i pi pi pi 

dy a / dy b / dz a I 
Jo Jo Jo 

dy a I dy b [ dz a [ d Zb e- j2 ^ {vb - ya){Zb - Za) 

10 Jy a Jo Jo 

where the second equation follows from the symmetry of the 
integrand. Note that this expression does not depend on the 
horizontal positions of the nodes. This can be interpreted as 
follows. Provided the above approximation is valid, the MIMO 
capacity scaling between two clusters of M nodes separated 
by a distance d ^> \f~A~ c is the same, be the nodes uniformly 
distributed on two squares of area A c or on two parallel 
(vertical) lines of length \f~A c . This result is of interest in 
itself and can be proven rigorously. 

We show below that the above integral is indeed of order 
d/A c . Let us compute the first integral, which yields 
• i 

'z b e 



Proof of Inequality (fT31 ): We start again with the expression 
for S in ( [ToT l. Note that due to the symmetry of A and p in 
w a and w b , we can upper bound ( [ToT l as 



dZl P^ 2 ^ 4f (Vb-Voi) (z b -z a ) 



j2nA c (y b - y a ) 
This implies that 



z b = l 
z b =0 







<Ka — 



A c \y b - y a \ 



for a constant K§ independent of A c and d. We can divide the 

integration over y a and y b into two parts, 

,1 ,i ,i ,i 

dy a / dy b / dz a / dz b e -J^^( Vb - Va )(z b -z a ) 

10 Jy a Jo Jo 

/ /•! r(»a+ea)Vl /-l-£3 A \ 

= dy a dy b + dy a I 

\Jo Jy a Jo Jy a +s 3 ) 

x dz a dz b e -i2f-# (»>-».) 

JO JO 

for any < £3 < 1. The first term can be simply bounded by 
£3, which yields the following upper bound for So 



S < 2e 3 + 2K 6 — / dy a 

Ac Jo Jy a +e 3 



dyb 



1 



m - y a \ 



< 2s 



2K 6 — Iog(l/e 3 ) 



So choosing 63 = d/A c , we finally obtain 



S « So < K'i — log(A c /rf) 



S<^ 
-Af 



dx a / dx b 

D T Jd t 



dw ■ 



a j 27r(||a! a -«)||-||a!b-tu|[ 



\X a - W 



\X b 



W 



Expressing this upper bound more explicitly in the coordinate 
system in Figure [3] we obtain the following upper bound for 

S, 



for a constant independent of A c and d. We will next 
prove ( fT3T > without making use of the above approximations. 



where 



1 pi pi pi 
dx a / dy a / dx b / dy b 
Jo Jo Jo 



r 1 e 32 " 9a,b(w,z) 

dw / dz— — r 

Jo (*a,b{w, ~ 



(19) 



g a . b (w,z) =\J (d+ \pV c (x a + w)) 2 + A c (y a - z) 2 

- \l(d+ y/A~ c (x b + w)) 2 + A c (y b - z) 2 . 

and 

G a>b (w,z) = d- 2 \j (d + y^A~ c (x a + w)) 2 + A c (y a - z) 2 

x yj(d + \J~A c {x b + w)) 2 + A c (y b - z) 2 . 

Let us first focus on the integral inside the square in ( fT9b . 
The key idea behind the next steps of the proof is contained 
in the following two lemmas. 

Lemma A3: Let g : [0, 1] — >• R be a C 2 function such that 
> ci > for all z <E [0, 1] and g" changes sign at most 
twice on [0, 1] (say e.g. g"(z) > in [z_,z + ] and g"(z) < 
outside). Let also G : [0, 1] — > R be a C 1 function such that 
\G{z) I > C2 > and G'(z) changes sign at most twice on 
[0,1]. Then 



dz ■ 



o3 2ng(z) 



G(z) 



< 



14 



7T Cl C2 

Lemma A.4: Let g : [0, 1] — > R be a C 2 function such that 
there exists zo G [0, 1] and c\ > with |.g'(z)| > c\ \z — Zq\ 
for all z E [0, 1] and g" changes sign at most twice on [0, 1]. 
Let also G : [0, 1] -> R be a C 1 function such that \G(z)\ > 
C2 > and G'{z) changes sign at most twice on [0,1]. Then 



dz ■ 



j1ir g(z) 



< 



14 



7TCiC 2 



jo G(z) 

The proof of Lemma IA.3I is relegated to Appendix [B] The 
proof of Lemma IA.4I follows the same lines and is omitted 
due to space limitations. 

Let now £3 > and let us divide the integration domain 
(x a ,x b ,y a ,yb) G [0, l] 4 in (TT9l l into three subdomains (see 
Figure 0): 



Ui - 
U 2 = 
U 3 - 



[\y a - Vb\ - {\fA~c/d) \x b - x a \ > e 3 1 
{0 < \y a - Vb\ - (\/A~c/d) \x b - x a \ < £ 3 1 
Uy a - Vb\ < (VA~c/d) \x b -x a \j 



Consider first the integral over U\. It can be verified from the 
expression d23l for the first order partial derivative of g a b with 







• 












u 3 J>«^' 












Ui 





Fig. 4. Domains of integration: the relative positions of the points x a and 
xij determine in which domain one is (Ui on the figure). 



respect to z given in Appendix iBl that if (x a ,x b , y a ,yb) € U\, 
then 



dz 



\w,z) 



> K 7 (\y b -y a \- \x b - x a \ 



for a constant Kj > independent of A c and d. Notice next 
that \G a ,b{y, z)\ > 1. It can further be checked that both 
9 gj2 ,b (w, z) and dG d a z b (w, z) change sign at most twice on the 
interval z € [0, 1] (for w fixed). Therefore, applying Lemma 
IA.3I we conclude that 



dw I dz — - — — — 

JO <^a,b{Z) 
d 

<K R — 



< I dw 
'o 



e i27rg aib (u),z) 

) G a , b (y,z) 



1 



A c 1 2/6 - y a \ - (y/Ac/d) \x b - x a \ 

Since we know that this integral is also less than 1, this in 
turn implies 

dx a dx b dy a dy b 



Ui 



r 1 e j27r 9a,b(w,z) 
dw / dz— — — 



< K s — I dx a dx b dy a dy b 



c JUi 



\yb - y a \ - (vAc/d) \x b - x a \ 



: K 8 — Iog(l/e 3 ) 

Second, it is easy to check that 

l p i 

dx a dx b dy a dy b 



u 2 



e j2n 9a,b(w,z) 

dw I dz ~F — i — r 

Jo G a , b (w,z) 



< 2e,. 



The integral over the third domain of integration <7 3 is more 
delicate. Notice first that the obvious bound 

2 



dx a dx b dy a dy b 
i u 3 

allows to obtain 



f 1 e j 2jt g a ,b{w,z) 

dw / dz— - — — 

Jo G a , b {w,z) 



< 2- 



S<K 8 ^- log(l/ £3 ) + 2e 3 + 2 ^ 

which can be made smaller than K 3 (d/A c ) \og{A c /d) by 
choosing £3 = d/A c when A Z J 4 < d < A c (as \[A~ c jd < 
d/A c in this case). 



For the remainder of the proof, let us therefore assume that 
\[A~ C <d< Al /4 . As before, we focus on the integral inside 
the square in the following term 



dx a dx b dy a dy b 



f 1 e J'2ir g a , b {w,z) 

dw / dz— — — 

Jo G a , b {w,z) 



(20) 



Let us start by considering the simplest case where the points 
x a and x b are located on the same horizontal line, i.e. y a = 
y b . In this case, the second term in the expression (l23l l for 
^?^-{w,z) becomes zero, so we deduce the following lower 



bound: 



dz 



(w,z) 



A 



3/2 



d 2 



\x b - x a \ \z - y a \ 



This, together with the above mentioned properties of the 
functions g a j, and G a . b , allows us to apply Lemma IA.4I so 
as to obtain 



1 f 1 e 3 2 K g a , b (w,z) 

dw / dz— — - 

JO G a , b {w,Z) 



< K 



1 



10 



Al /4 y/\x b - x a 



for a constant K\o > independent of A c and d. A slight gen- 
eralization of this argument (see AppendixlBlfor details) shows 
that not only when y a = y b but for any {x a , x b ,y a ,y b ) € U 3 , 



cJ^ti g a ,b(w.z) 



we have 








f dw f 




Jo Jo 



G a ,b{w, z) 

d 

< K 10 
<K 10 



1 



AT fab ~ ^a) 2 + (2/6 - 2/a) 2 ) 1 / 4 
y/\x b -X a \ 

Since we also know that the above integral is less than 1, we 
further obtain 

f 1 e J'2ir g a , b (w,z) 2 f d 2 1 

dW dZ G h (w z) - lain ( Kl0 ^sj2 \ Xh - x I ' 

JO ^a,b\W,Z) \ A c ' \X b — X a \ 

For any < 77 < 1, we can now upper bound d20t as 



dx a dx b dy a dy b 



f 1 e j2n ga,b(w,z) 

dw / dz— — 

Jo & a ,b{w,Z) 



< \U 3 f]{\x b - x a \ < 17} I 

f d 2 1 

+ K w / dx a dx b dy a dy b 

Ju 3 n{\x b - Xa \> v } Aj \Xb-Xa\ 

<2rj + K w ^ log(l/r/) = 2r, + K 10 ^ log(l/Tj) 



-4 



implying that 



S < K 8 4~ lo s(V£3) + 2e 3 + 2r? + K w log(l/7?) 

Choosing finally e 3 = r\ = d/A c allows to conclude 
that S < K (d / A c ) \og(A c / d) also in the case where 



<Ar < d < A 



3/4 



□ 



Appendix B 
Technical details 



Proof of Lemma IA.il By the integration by parts formula, 
we obtain 



1 e j2ng(z) 

dz ■ 



1 dz J' 2 ^) e ^W 
o j2ng'(z)G{z) 



G{z) 

^ 2 * 9{z)) 1 _ f 1 dz g"(z)G(z)+ g i(z)G'{z) pj 2T9(2) 
o Jo 



j27rg'(z)G(z) 



j2ir(g'(z)G(z)) 2 



which in turn yields the upper bound 



e j2-Kg(z) 

^^W 

1 



< 



1 



1 



2tt V 1^(1)110(1)1 |<?'(0)||G(0)| 



dz ■ 



9"(*)\ 



dz ■ 



\G'(z)\ 



/ {g'(z)Y\G{z)\ J g'(z)(G(z))^ 
By the assumptions made in the lemma, we have 



dz 



WW 



< 



1 



(g'(z)) 2 \G(z)\ ~ c 2 J {g'{z)Y 



dz 



\g"(z)\ 




dz 



{9'{z)y 



c 2 U'(l) g'(0) g'(z-) g'(z + ) 



So 



\g"(z)\ 



< 



6 



/ {g'{z)f\G{z)\ ~ c lC2 
We obtain in a similar manner that 



dz 



\G'(z)\ 



o 9'{z){G{z)y 



< 



G 



c\ c 2 



Combining all the bounds, we finally get 



e j2-!rg(z) 



< 



14 



7TCi C 2 



□ 



Expression for the first order derivative of g a .b(w, z ) : K can 
be verified that 

g a ,b( w i z ) = 



(d/ v^?c + a; + w) (ia; 



\/(d/7X +x + w) 2 + {y a - z) 2 

i A i (y- z ) d v 



(22) 



'3/a VWV^ +%b + W) 2 + {y- z) 2 

So the expression for the first order partial derivative of 
ga,b(w, z) with respect to z is given by 

rXt (z~y a )(d/y/A~ c + x + w) dx 



dg a ,t 



dz 



(w,z) 



'A, 



'A r 



*a ({d/y/AZ + x + w) 2 + (z-y a ) 2 ) 312 
{dj\[A~ c + x b + w) 2 dy 



va ((d/^jA~ c + x b + w) 2 + {z- y) 2 ) 



3/2 



(23) 




Fig. 5. Tilted reference frame. 

Proof of equation ( ETl ): In order to prove (fJTJ, we need 
to make a change of coordinate system, replacing (w, z) by 
(«/ , z'), where w' is now in the direction of the vector x a — 
Xb and z' is perpendicular to it (see Figure ). In this new 
coordinate system, the integral reads 



dw' dz' 



J 2-7rg a ,b{w' ,z') 



G a , b (w',z>) 

where g a .b(w' , z'), G a ,b(w',z') have the same form as 
g a ,b(w, z), G a ^(w, z), but now, the domain of integration D R 
is a tilted square, as indicated on the Figure [5] Using then the 
same argument as in the case y a = y^, we conclude that 



aj 2ng a , b {w' ,z') 



dw'dz' ■ 

D R G a ,b( w 



Noticing finally that la;^ 
allows to conclude ((21b . 



< K 



1 



10 



.4 



3/4 



J = V( X b - Xa) 2 + (y b - y a ) 2 
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